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Abstract. 

Here we describe the Red MSX Source (RMS) survey which is the largest, 
systematic, galaxy-wide search for massive young stellar objects (MYSOs) yet 
undertaken. Mid-IR bright point sources from the MSX satellite survey have 
been followed-up with ground-based radio, millimetre, and infrared observations 
to identify the contaminating sources and characterise the MYSOs and UCHII 
regions. With the initial classification now complete the distribution of sources 
in the galaxy will be discussed, as well as some programmes being developed to 
exploit our sample. 



1. Introduction 

Massive stars (M>8 M , L>10 4 L Q ) play a fundamental role in many areas of 
astrophysics. They are the principal source of UV radiation and heavy elements 
in galaxies, and are responsible for injecting huge amounts of kinetic energy 
into the ISM through powerful molecular outflows, strong stellar winds and su- 
pernova explosions. The momentum imparted through these processes provides 
an important source of mixing and turbulence within the ISM. They are also 
thought to play a key role in regulating star formation, either by disrupting 
molecular clouds before stars have been able to form, or constructively through 
the expansion of their HII regions (e.g., collect and collapse) or the propaga- 
tion of strong shocks into their surroundings (e.g., radiatively driven implosion). 
Massive stars have an enormous influence not only on the physical structure and 
chemistry of their local environment, but also on the structure and evolution of 
their host galaxies. 

Despite the importance of massive stars the processes involved in their 
formation and the early stages of their evolution are still poorly understood. 
Massive stars reach the main sequence while still deeply embedded within their 
natal molecular clouds, and therefore their entire pre-main sequence formation 
and evolution is hidden behind high levels of extinction. They are extremely 
rare and as a consequence are generally located farther away than regions of 
low-mass star formation. The large distances are compounded by the fact that 
massive stars are through to form exclusively in clusters and limited spatial reso- 
lution makes it difficult to identify and attribute derived quantities to individual 
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sources. Furthermore, the evolution of massive stars is extremely rapid, which 
means that key stages in their early evolution are short lived. Due to these 
observational difficulties, until relatively recently, the only catalogue of massive 
young stellar obj ects (MYSOs) had been limited to 30 or so serendipitously 
detected sources ( Henning et al.|["l984l ) most of which are nearby. 



2. Colour-selection of MYSO candidates 

As massive stars reach the main sequence while still deeply embedded, the ob- 
vious place to begin to search for MYSOs is at inf rared wavelengths wh ere 
the majority of the bolometric luminosity (>10 4 L Q ; IWvnn- Williams! Il982l ) is 
emitted after reprocessing by dust. Nuclear fusion has almost certainly be- 
gun due to the very short Kelvin-Helmholtz contrac tion time scale compared 
to their free-fall time scale ( Behrend &: Maeder 20011 ). Accretion is likely to be 



taking place as the majority of MYSOs are associated with massive outflows 
( Wu et all 120041 ) . generally thought to be powered by accretion. Although fu- 



sion is taking place, the ongoing accretion is thought to impede the formation of 
an HII region either by: 1) significantly increasing the star's radius, leading to a 
lower effective stellar te mperature and characteristic accretion disk temperature 
(Hoa re & Franco I I2007T) . or 2) the HII region being quenched by high infall rates 
(|Walmslevl 1 19951 ) . MYSOs are also known to posse ss ionised ste llar winds that 



are weak thermal radio sources (~1 mJy at 1 kpc; iHoare 1 12002| ) . MYSOs can 
be roughly parametrised as mid- infrared bright, and thus slightly later than the 
hot core phase, and radio quiet, and thus earlier than the UCHII region phase, 
with luminosities consistent with young O and early B-type stars. 

To date there have been several attempts to search for MYSOs using colour- 
selection criteria and the IRAS point sou rce catalogue (e.g.. lMolinari et ai1ll996l ; 



Walsh et al.lll997l ; ISridharan et al.ll2002l ). However, due to confusion in the large 



IRAS beam (~3-5' at 100 fim) these samples tended to avoid dense clustered 
environments and the Galactic mid-plane where the majority of MYSOs ar e 
expected to be located since the scale height of massive stars is ~30' (|Reedll2000l ). 



Although these IRAS selected samples have identified many genuine MYSOs, 
they tend to be biased towards bright, isolated sources that are probably not 
representative of the general population of MYSOs. There is a need for an 
unbiased sample of MYSOs that is large enough to have a sufficient number of 
sources in each luminosity bin to allow the processes involved in massive star 
formation to be investigated in a statistically r obust way. 



The Midcourse Space experiment (MSX; iPrice et ail l200ll ) surveyed the 



whole Galactic plane (|fe|<5°) in four mid-infrared bands centred at 8, 12, 14 
and 21 fim at a resolution of 18" in all bands. Although the sensitivity of MSX 
is similar to that of IRAS, its beam size is ~50 times smaller than that of the 
IRAS 12 and 25 /iin bands and thus avoids most of the confusion problem found 
with IRAS. The MSX survey therefore offers us the opportunity, for the first 
time, to identify a truly representative sample of MYSOs located throughout 
the Galaxy. More recently t he Spitzer satellite ha s surveyed a large region of 
the inner Galaxy (GLIMPSE; [Benj amin et "aTll2005l ) with unprecedented spatial 
resolution and sensitivity, offering a great improvement on MSX data. However, 
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most MYSOs are found to be highly saturated and would be missed in any 
GLIMPSE colour-selected sample. 

In order to select potential MYSOs we compared the colours of known 
MYSOs with sour ces i dentified in the M SX and 2MASS point source catalogues 
( Egan et al. 2003 and Cutri et alj|2003l respectively) and developed colour se- 
lection criteria (jLumsden et al.ll2002l ). A lower limit of 2.7 Jy was placed on the 
21 jira flux. The detection of a source in 2MASS was not necessary as MYSOs 
can be totally obscured in the near-infrared; these data were just used to exclude 
sources that are too blue. The MSX images of all colour selected sources were vi- 
sually inspected to eliminate sources that are not point-like (see below). Sources 
toward the Galactic centre (|/| < 10°) were excluded due to confusion and diffi- 
culties in calculating kinematic distances. In total we identified approximately 
2000 MYSO candidates. 



3. Multi-wavelength follow-up observations 

The shape of the spectral energy distribution from an optically thick cloud is 
insensitive to the type of heating source. This is a problem with a colour-selected 
sample such as ours, as it results in contamination of our sample by several other 
types of embedded, or dust enshrouded objects that have similar near- and mid- 
infrared colours to MYSOs (e.g., ultra compact (UC) HII regions, evolved stars 
and planetary nebulae (PNe)). The core of the RMS survey is a multi- wavelength 
programme of follow-up observations designed to distinguis h between genuine 
MYSOs and these other types of embedded or dusty objects ( Hoare et al.ll2005l ) 



and to compile a database of complementary multi-wavelength data with which 
to study their properties. 

The main source of contamination of our sample is likely to be by UCHII 
regions. These can easily be identified through the free-free emission emitted by 
their ionised nebular gas. Radio continuum observations are thus an essential 
part of our follow-up program me. We have complete d a programme of 5 GHz 
observations using the ATCA (|Urauhart et al.l l2007h and the VLA (Urquhart 



et al. in prep.) of all MYSOs candidates. These observations have a spatial 
resolution of 1" and a sensitivity of 1 mJy (3a), sufficient to d etect a B0. 5 or 
earlier type star on the far side of the Galaxy ( Giveon et al .11 200*51 ) . About 25% of 



RMS sources have detectable radio emission, the majority of which are classified 
as UCHII regions, due to their morphologies and tight correlation around the 
Galactic mid-plane. 

Mid-infrared imaging has been used to identify genuine point sources from 
more extended emission . Given the typical size of MYSOs at 10 ^m is 0.005 pc 
( Churchwell et al. 19901 ) and a spatial resolution of 1" we would not expect to 



be able to resolve a MYSO at distances > 1 kpc. However, UCHII regions with 
a typical size of ~0.1 pc would be resolved at a distance of 15 kpc (~1.4") as 
would most PNe. Therefore, mid-infrared imaging can prove extremely useful 
in identifying, and removing extended mid-infrared objects. Moreover, mid- 
infrared imaging is complementary to the radio continuum observations and can 
be used to obtain accurate astrometry and avoid excluding MYSOs that are 
located in close proximity to UCHII regions. For the majority of our sources 
the GLIMPSE data have been used for this purpose, however, the ~700 sources 
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located outside the GLIMPSE region hav e been observed using UKIRT, Gemini 



and the ESO 3.6 m telescope at La Silla (jMottram et al. 200 



ng l 
22). 



To obtain kinematic distances we have made molecular line observations 
towards every source using 13 CO (J=2-l) and (J=l-0) transitions and Mopra, 



Onsa la and Purple Mountain Observatory telescopes, and the JCMT (jUrquhart et al 



20071 ; Urquhart et al. submitted to A&A). We have complemented these ob 



ser vations with archiva l data extracted from the Galactic Ring Survey (GRS; 
see iJackson et all 2006 for details). In cases where multiple CO components 
are detected towards an RMS source we have used a combination of nearby 
methanol and water masers, or CS (J=2— 1) observations to identify the com- 
ponent associated with the MYSO candidate. We hav e calculated kinemati c 
distances using the source vlsr and the rotation curve of lBrand & Bht3 (|1993H : 
these are crucial for calculating luminosities and thus distinguishing between 
nearby low- and intermediate-mass YSOs from genuine MYSOs. The major- 
ity (~80%) of our sample is located within the solar circle and therefore the 
rotation curve produces two possible kinematic distances, referred to as the 
'near' and 'far' distances; this is known as the kin ematic distance am bigu- 
ity. We have used the HI self-absorption technique ( Jackson et al.| [2002) and 
HI data from the International Galactic Plane Survey (IGPS; for details see 



http://www.ra s.ucalgary.ca/IGPS/ ) to resolve these distance ambiguities (e.g., 
Busfield et afll200d ). 



While the observations just discussed are useful in eliminating the vast ma- 
jority of UCHII regions, PNe and low- and intermediate-mass YSOs, they are 
not so effective at eliminating evolved stars. We have eliminated most of these 
using near infrared images in combination with other available data since they 
tend to appear as isolated point sources at all wavelengths, and are not gener- 
ally associated with strong CO emission ( 12 CO (J=l-0 and J=2— 1) typically 



less than 1 K; Loup et al. 19931 ). In contrast, near-IR images of MYSOs often 



show associated shocked emission from outflows, reflection nebulosity, extinction 
lanes, as well as clusters of lower mass YSOs. 

The combination of near- and mid-infrared imaging, radio continuum and 
molecular line observations allows us to eliminate most of the contaminating 
sources. However, there are a few weak UCHII regions, or chance alignments of 
evolved stars with molecular material, that are not identified. To weed out any 
remaining contaminating sources, and to confirm the identifications of our final 
sample of MYSOs, we are obtaining ne ar-infrared spectro scopy from which a 
definitive classification can be made (e.g., Clarke et al. 20061 ). We have obtained 
spectra towards approximately 350 RMS sources so far. 



4. Results 

As the observational data have become available they have been used to classify 
the sources in our sample. So far we have classified all but a few of the ~2000 
sources and have identified ~450 YSOs and a further ~500 UCHII regions (see 
Fig. Q] for a detailed breakdown). Along with the YSOs and UCHII regions 
so far identified, a further ~200 sources have been identified as either YSOs 
and/or UCHII regions. We are waiting on additional data before making a final 
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Figure 1. Classification and distribution of RMS sources identified so far 
using data obtained from our multi-wavelength observations. 




Figure 2. Left panel: Histogram of the luminosities for sources identified as 
either YSOs or UCHII regions and for which good IRAS dat a was a vailable. 
These luminosities have been calculated following lEmersonl (|1988f) . Right 
panel: Distribution of source luminosities as a function of heliocentric dis- 
tance. 



In the left panel of Fig. [2] we present a histogram of the luminosities of 
sources that have been classified as either YSOs or UCHII regions and for which 
good quality IRAS data are available (~25%), and for which any distance am- 
biguities have been resolved. In the right panel of Fig. [2] we plot the distribu- 
tion of luminosities as a function of heliocentric distance. These plots reveal 
that a number of the sources identified as YSOs are actually nearby low- and 
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intermediate-mass YSOs. However, the vast majority have luminosities consis- 
tent with late O and early B-type stars. 




-10 10 
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Figure 3. Galactic distribution of RMS sources identified as either UCHII 
regions or YSOs . The location of the spiral a r ms ta ken from model by 
iTavlor fc Corded (|1993l ) and updated by ICordesI (|2004t ) are over-plotted in 
grey. The positions of the Galactic centre and Sun are shown by black filled 
and unfilled circles respectively. The Roman numerals in the corners refer 
to the Galactic quadrants and the two thick black lines which originate from 
the location of the Sun enclose the region excluded from our survey. The two 
dashed circles indicate the solar circle and tangent points. 

We have determined kinematic distances for all YSOs and UCHII regions, 
for which an una mbigu ous vlsr could be attributed, using the rotation curve 
of iBrand fc Blit3 <|l993l h Using these distances and source positions in Fig. [3] 



we plot the distribution of YSOs and UCHII regions on a face-on view of the 
Galactic plane. On this plot we have indicated the positions of the Galactic 
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centre, the Sun, the solar circle and the spiral arms (see figure caption for de- 
tails). Massive stars are thought to be primarily associated with the spiral arms, 
and thus could potentially be used as a probe of Galactic structure. Comparing 
the distri bution of our sources and the position of the spira l arms (from the 
model bv lTavlor fc Corded fl993l and updated bv lCorded 12001 ) we find them to 
be reasonably well correlated with each other, especially taking into account the 
errors in the kinematic distances (typically ±1 kpc). The correlation is partic- 
ularly apparent towards the inner Galactic arms (i.e., 1st and 4th quadrants), 
but quite poor for sources in the outer Galaxy. 



5. Summary 

The RMS survey aims to produce a large, unbiased sample of massive young stel- 
lar objects (MYSOs) located throughout the Galaxy. We have colour-selected 
approximately 2000 MYSO candidates from the MSX and 2MASS point source 
catalogues that have colours similar to known MYSOs. We have almost com- 
pleted a multi-wavelength programme of follow-up observations to distinguish 
between genuine MYSOs and types of embedded or dust enshrouded objects 
that have similar mid-infrared colours that contaminate our sample. So far us- 
ing these observations we have unambiguously classified ~90% of our sample, 
approximately half of which are identified as either YSOs and UCHII regions. 
The luminosities of a sub-sample of these sources, for which good IRAS data 
are available, confirms that we are detecting a significant number of genuine 
MYSOs. When complete, we estimate our database will contain a well selected 
sample of ~500 bona fide MYSOs and a further 500-600 UCHII regions, along 
with complementary multi- wavelength observations. 

Now the classification is almost complete we have begun a number of obser- 
vational programmes to exploit our sample and to address many open questions 
concerned with massive star formation. These include: high resolution observa- 
tions of potentially triggered regions to investigate different triggering mecha- 
nisms; chemical surveys using millimetre lines and IR absorption features to in- 
vestigate the chemistry and possible chemical evolution of MYSOs; investigation 
of molecular outflows associated with MYSOs; and high resolution spectroscopy 
of accretion disks. All of these programmes are being conducted on sub-samples 
selected as a function of luminosity, distance and location. The whole sam- 
ple and all of the observational results of our multi-wavelength campaign are 
available at www.ast.leeds.ac.uk/RMS. 
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